Method for improving the functionality of a binary representation

ABSTRACT

A method for improving functionality of a binary representation of an XML-based content description, wherein a structure of any instance of an XML-document corresponds to a tree-like data structure including a plurality of tree nodes, is provided. The method includes providing that each tree node represents an element of the content description and has a structure which is defined in a schema; providing the tree nodes in binary representation with tree branch codes; providing that the respective tree branch code has a schema branch code; extending an existing schema by utilizing unused schema branch codes in a tree node for extensions with new elements until the unused schema branch codes are all used up, and increasing a bit length of the schema branch code as necessary; and communicating a bit length change to a decoder for correctly decoding the content description in the binary representation.

PRIORITY CLAIM

This application is a continuation of application Ser. No. 10/451,592,titled “METHOD FOR IMPROVING THE FUNCTIONALITY OF THE BINARYREPRESENTATION OF MPEG-7 AND OTHER XML-BASED CONTENT DESCRIPTIONS,”filed Oct. 20, 2003, the entire contents of which are incorporatedherein.

BACKGROUND OF THE INVENTION

The present invention relates to the coding and decoding of structureddocuments based on XML, such as are provided for by MPEG-7, for example.XML (extensible markup language) is a standard for the definition ofdocument structures. It is used for representing structured data in atext file and forms the basis for the language XHTML, for example. Suchdocuments structured on an XML basis are based on a set of structuredelements, also referred to in the following as a “schema”, such as canbe specified using, for example, a document type definition (DTD), anXML schema, or multi-media description schemes (DS).

A draft paper from the ISO/IEC, namely CD 15938-1 InformationTechnology—Multimedia Content Description Interface: Systems, ISO/IECJTC 1 SC29/WG11/N3701, La Baule (France), October 2000, in particular onpages 15 to 22, discloses the binary format of MPEG-7 files and thestructure of navigation paths using tree branch code tables.

The present invention is concerned with the optimization of the codingof structured XML documents. An object of the present invention is,thus, to specify methods for improving the functionality of the binaryrepresentations of XML-based content descriptions, particularly forMPEG-7 documents, using which the data set to be transmitted will be assmall as possible, search operations within the document will be assimple as possible, and any extensions to an instanced document whichare not contained in the schema template can be effected with the leastpossible effort.

SUMMARY OF THE INVENTION

One of the consequences of the ISO/IEC draft mentioned above is that thestructure of an XML document can be interpreted as a data tree, whereeach element of the description corresponds to a node in this tree. Thestructure of the nodes is defined by the definition in the schema onwhich the document is based. In particular, the type and number ofchild-elements are defined by it.

These tree-structure nodes consist of the name of the element or complextype, a field with TBC words (Tree Branch Code), which are used forreferencing the child-elements, and the tree branches which representthe references to the appropriate child-elements.

It is also possible to deduce from the draft that the TBCs break downinto two components, namely a schema branch and an item of positiondata, where the schema data is derived from the elements which occur aschild-elements in the schema, while the item of position data containsthe position data for those elements which can occur repeatedly. Here,the possible types of child-element are elements of the type ComplexType, which can contain child-elements, or elements of type Simple Typeor Attribute, which cannot contain a child-element.

The length of the field #position is determined by the maximum number(“maxOccurs”) of the element concerned, which is specified in theschema. To cover the situation that, in the example here, the maximumnumber is greater than 7 or is unbounded, the field is lengthenedadaptively until it is possible to represent the position to be encoded.This breakdown has the property that the schema branch code or SBC#SchemaBranch code is always the same, regardless of how many childrenare or could be present in the current instantiation.

In order to move around within the document, the TBCs (i.e., theSchemaBranch codes and, where applicable, any PositionCodes), are put insequence, which produces a path in the document. When the desiredelement is reached, the last code is inserted in the table. If thedesired element cannot have any more children (i.e., is it is anAttribute or a Simple Type), then this termination code is unnecessaryand is not sent. In this case, the Attribute or the Simple Type elementis then transmitted in coded form.

Additional features and advantages of the present invention aredescribed in, and will be apparent from, the following DetailedDescription of the Invention and the Figures.

BRIEF DESCRIPTION OF THE FIGURES

FIGS. 1 a to 1 d show the addressing of various element types to assistin explaining the improvement in compression.

FIG. 2 shows an XML schema text.

FIGS. 2 a and 2 b show the node tables for the schema text in FIG. 2.

FIGS. 3 a and 3 b show diagrams to explain an improvement in the searchpossibilities in accordance with the present invention.

FIGS. 4A and 4B show sections of the data stream to explain theimprovement in extensibility.

FIGS. 5 a and 5 b show representations of extended tree branch nodes toassist in explaining the improvement in extendibility.

FIGS. 6 a and 6 b are similar to FIGS. 2 and 2 b, but with extendedelements.

FIGS. 7 and 8 show a sequence for a decoder to skip over unknownelements.

DETAILED DESCRIPTION OF THE INVENTION

The present invention basically uses of two different schema branchcodes, of which one is far more frequently used and thus effects acompression, wherein the schema branch code and the position code arecombined and the bit length for the schema branch code is transmittedwith them. The search function is simplified in that the first partalone specifies the type of element referenced, and in that improvedextendibility is achieved on the basis of a schema version number whichmust be transmitted together with fixed predefined extension strategieswhich are also known to the decoder.

Improvement in Compression

FIG. 1 a shows a method used until now for addressing a Simple Typeelement or Attribute, and FIG. 1 b shows a method of addressing aComplex Type element in a way corresponding to the familiar methods.FIGS. 1 c and 1 d show the corresponding forms of address using themethod according to the present invention. This makes it clear that useis made of two different schema branch codes SBC-A and SBC-B, and notthe general schema branch code SBC-B alone. As initially mentioned, suchan address path consists of TBC codes chained together, wherein, ifnecessary, there also may be position codes #pos between the schemabranch codes SBC-A, and only at the end will there be a schema branchcode SBC-A with a path termination code and no further positionspecifications, followed by a general schema branch code SBC-B, whichalso may contain Simple Type elements or Attributes, which form theleaves of the tree-type structure.

From the structure described for the path, including chained TBC codes,it can be seen that only the last TBC of the path can refer to anAttribute or a Simple Type element. All the preceding TBCs must refer toComplex Type elements, because only these can have child elements. Inthe method according to the present invention, two different tables forthe #SchemaBranch codes SBC are now introduced for each node, with theobjective of reducing the length of the code for positioning within thedocument by comparison with that in the ISO/IEC draft mentioned above.Table A contains only the elements of Complex Type; that is, thoseelements which can have child elements. The other table contains all theelements; i.e., including the Attributes and the Simple Type elements.It should be noted that no SBC needs to be reserved for the pathtermination. The #SchemaBranch codes in the two tables are referred tobelow as SBC-A and SBC-B, respectively. The complete path is, in turn,formed by chaining together TBCs, with all the TBCs except the lastbeing formed using SBC-A and, where applicable, the appropriate#Position codes. The end of the first part of the path, created usingTable A, is signaled by a termination code; for example, all bits 1.There then follows exactly one TBC, the #SchemaBranch code for which istaken from Table B. It should be noted that, in the method according tothe present invention, the termination code also must be sent if anAttribute or a Simple Type element is addressed. As the length of the#SchemaBranch codes depends on the number of possible elements, thecodes in Table A (i.e., the SBC-A codes), are correspondingly shorter.The fact that the SBC-A codes are used significantly more frequentlythan the SBC-B codes also has an advantageous effect on the compression.

FIG. 2 shows an example of an XML schema text and FIGS. 2 a and 2 b showthe associated node tables for SBC-A and SBC-B. From these, it is clearthat the schema branch codes for the SBC-A can be shortened, becausehere there is no need to reference the Simple Type elements andAttributes.

Improved Search Function

A functionality which is required by the binary representation butwhich, with the method according to the ISO/IEC draft, cannot be usedwithout restriction is a simplified search in the document for certainelements. Ideally, it should be possible to perform this search using asimple filter mechanism, by using a comparison against a bit pattern tosearch in the bitstream for a predefined bit sequence which uniquelyaddresses the element sought in the document. To search rapidly for aparticular element in the document tree, the bitstream will then beparsed and closer attention will only be given to those elements whichare addressed by the correct path fragment. For the method as carriedout in the ISO/IEC draft, this type of filtering cannot be performedwithout restriction, because the length of the #Position codes cannot bedetermined in advance if there is at least one element in the schema forwhich the maximum number is greater than 7 or is unlimited.

In the method according to the present invention, the tree branchingnodes (TBCs) which specify the path are subject to a partial re-sorting,with the objective of enabling a simple filtering of the bitstream. Thismoves the #Position codes to the end of the path. The advantage of thisis that the first part of the path, which contains the #SchemaBranchfragments, by itself specifies the type of the element referenced.

In an alternative solution, a first step separates the #Position codesinto a part with a fixed length and a part with a variable length. In asecond step, the parts with variable length are taken out of the TBCsand are moved to the end of the path.

With absolute addresses it is then possible, when searching for aparticular element, to define the bit pattern in advance. When relativeaddresses are being used, the pattern depends on the current positionwithin the document. For this situation, the new methods produces asimplification in that the #Position codes do not need to be decoded andanalyzed for filtering purposes.

For a complete reference, the complete path including the complete#Position codes must be read and decoded to enable correct branching ateach node into the child-element referenced.

To simplify the implementation of this method, it is possible to send atthe start of the path a specification of the overall length L of thepath, typically in bits, excluding the #Position codes at the end, sothat a pointer Z for the #Position codes can be simultaneously updated.As such, the correct positions can be decoded in parallel with the SBCs.In addition, this also will make possible a search of particularpositions (#Positions) for the elements sought, and will support asearch in the case of extendibility, as explained below, in which a partof the path is not known to every decoder.

FIG. 3 a illustrates these relationships by an example of the addressingof a Simple Type element or Attribute, under the previous method. FIG. 3b shows the corresponding example for the method in accordance with thepresent invention. From FIG. 3 b it is clear that all the schema branchcodes SBC-B1 . . . SBC-B5 for any particular path are arranged one afteranother and in total have a length L, which is transmitted at the verybeginning as the first item. The position codes #pos 1 . . . #pos 5 areseparated from the SBCs and are arranged one after another. The bitpattern for absolute addressing with a bit length of L can be determinedfrom the schema definition, so that it is possible to filter thebitstream by comparing it against a bit pattern.

Improvement in Extendibility

The coding schema, on which the algorithm of the ISO/IEC draft is based,is context-sensitive; i.e., the coding in each element includes onlythose other possibilities defined by the context. The decoder can onlyread the bitstream and interpret it correctly if it knows the schemadefinition. The decoder must know which TBC code refers to whichelement, and how long the bit code in each element is, so that thecorrect number of bits is read for each path fragment.

A situation which will often arise in practice is that a defined schemais retrospectively extended, to take account of new conditions, such asnew categories of metadata. These extensions can be optional elements orattributes. Documents held in XML text form, which were produced inaccordance with the old schema definition, continue to be valid inrelation to the new definition (forward compatibility). However, thesealso can be data types derived by inheritance which, in the case of arestriction (derived by restriction), retain the TBCs or, for anextension (derived by extension), are given an extended TBC table asdescribed below.

However, in the binary representation of documents, such as are shown inthe ISO/IEC draft, for example, this is not the case, because here newelements/attributes can be allocated TBCs which previously addressedother elements/attributes. In the method according to the presentinvention however, this disadvantage can be avoided by using thefollowing rules:

New, optional, elements only can be inserted into the Tree StructureNodes (TSNs) following existing elements, and only before any PathTermination Codes which may be present. When this is done, these newelements will be assigned schema branch codes (SBCs) which have not yetbeen assigned. In doing so, any existing elements will not lose theirschema branch code assignments.

If the extension would lead to addressing which uses longer addresses,then any binary representations would no longer be decodable, due to thechange in the code length. In order to solve this problem, and inaccordance with the present invention the following addressing isintroduced:

With respect to the schema branching code, new elements/attributes areentered after existing elements/attributes and before any existing pathterminations in the tree structure nodes, TSN. In this case, if no moreschema branch codes are available, then the addressing will be extendedby one or more bits; for example, the most significant bit. For example,the existing codes will be extended by adding a zero. An exception isthe Path Termination Codes, which are extended by 1 so that they remainthe last code in the tree structure node. New elements/attributes arethen assigned corresponding to the newly available schema branch codes,SBC.

The change in the bit length for the schema branch code must be signaledto the decoder. In order to make possible incremental extensions,preceding versions of the schema must be known to the decoder.

For this purpose, it is not necessary to save all the information forthe versions concerned. Instead, only the bit length or the number ofschema branch codes for the new versions of an appropriately modifiedtree structure node need to be saved, and transmitted if necessary,where the second possibility may have advantages in enabling erroneouscodes to be recognized. These details must be transmitted before thecoded schema branch codes which have been changed. In this way, the bitlengths of the schema branch codes are linked to the version numbers ofthe schema. Before a document is binary encoded, it is then onlynecessary to specify the version of the schema used, and not to transmitthe entire schema used as has been necessary until now.

For example, the bitstream definition of the ISO/IEC draft can beextended by adding a field for specifying the version. If no versioncheck is carried out, a schema definition from a standard (for example,MPEG-7), can be used as a reliably known reference. This schemadefinition could be designated, for example, as version 1. An exemplaryembodiment of such version details is given below:

In this case, both the version details and the bit length informationare stored as additional items in the stream header, as this isspecified in the ISO/IEC draft. For this purpose, the data as shown inFIG. 4 a is stored in the datastream.

The standardized versions can be assigned a unique version identifier,which is designated M7_Version_ID in FIG. 4 a. Furthermore, proprietaryextensions can be identified by an extension identifier, which isdesignated Extension_ID in FIG. 4 a. This can be specified even if thebit lengths of the extended tree structure nodes TSN are stored in thebitsteam. As shown in FIG. 4 a, this is signaled by a flag,DS_Extension. The bit length information for the tree branch codes TBCof the extended tree structure nodes TSN is coded in the DS_Update_Info() specified in FIG. 4 a, as shown in FIG. 4 b. The expressionNumber_of_changed_nodes signals the number of tree structure nodes whichhave been changed. This number can be coded with a variable length,corresponding to the position data suggested in the ISO/IEC draft.

The information about the changed tree structure nodes can be addressedin the bitstream by a navigation command, Navigation_Command, and anavigation path, Navigation_(—) Path. The change details transmittedthereafter then apply for all elements which are of the same type as thenode addressed. After this, the changed codeword length SBC_Length orthe changed number of schema branch codes is inserted into thedatastream. The codeword length or number is again coded in accordancewith the same method as used for coding the Number₁₃ of_changed_nodes.

In a further exemplary embodiment, the changed tree structure nodes canbe identified by direct addressing of the Complex Types in the schema.This direct addressing can be achieved, for example, by numbering offthe Complex Types defined in the schema.

There is a further problem in that a document coded in accordance withthe new schema should be decoded by a decoder to which only the earlierschema definition is known (backward compatibility). In an XML textualdocument based on XML this is possible for elements which were alreadyknown under the old schema. This depends on two properties:

The elements of the complex type defined in the old schema continue tobe retained, but can differ in the elements and attributes or the datatypes, as applicable, which they contain.

By using start and end markers for the elements, so-called Tags, newelements can be skipped and known ones decoded.

If the bit length change is transmitted for different versions, asspecified in the above addressing proposal, then an “old” decoder whichis working on the basis of an earlier schema still can decode knownelements in an extended tree structure node. However, pathspecifications which lead to a new element cannot be skipped by the“old” decoder, and it cannot decode any further. In order to supportthis important functionality, the method according to the presentinvention uses the following alternatives for backward compatible codeddocuments:

-   -   a) If new elements/attributes are addressed in a TSN, then the        transmission includes, in addition, at the start, the number of        bits for the complete sub-tree or successor tree for this        element/attribute, including the N bit content data which has        been inserted. In this way, the decoder is enabled to skip over        the next N bits, which are coded in a way unknown to it, and to        land back in the known TSN again.    -   b) After the transmission of a path which contains a new        element/attribute, a unique synchronization sequence is        communicated, which the decoder can use to land back in the        known TSN again.    -   c) Before transmitting any paths which contain new elements,        their TSNs, which represent a part of a complete schema, must        first be transmitted.    -   d) Before transmitting any paths which contain new elements, the        complete schema must first be transmitted.

In the case of alternatives c) and d), the decoder also can decode thecontent of the newly appended documents and, where applicable, can saveit or subject it to further processing.

The example illustrated in FIGS. 5 a and 5 b shows the changes whenthere is a new version of a schema definition, with FIG. 5 a showing anextended tree structure node for a Complex Type element and FIG. 5 bshowing an extended tree structure node for a modified schema. Theelements 3 to 6 have been added in the new version. This causes thelength of the schema branch code to increase from two to three. However,the previously-existing addresses are retained, they are simply extendedby the addition of a zero as the MSB.

There follows an example of the coding of extended schema elements,shown in FIGS. 6 a and 6 b. Here, the example used in conjunction withFIG. 2 serves as the starting point. For reasons of simplicity, themethod described above for splitting up the node table is ignored inthis illustration. The original schema “PurchaseOrderType” is to beextended by a number of elements. In FIG. 6 a, the elements which areextensions compared to FIG. 2 are highlighted in boldface.

That is to say, the elements “billTo”, “MethodOfPayment” together with“BankData” are new insertions. The new tree branch code table thereforemust be correspondingly extended. As a consequence, three bits is nolonger sufficient to encode all the possibilities.

How this extension is effected using a tree branch code of four bits isitemized in more detail in FIG. 6 b.

Under these general conditions, two cases are now dealt with:

Case 1:

Documents which have been coded in accordance with the old schemadefinition are transmitted to a decoder to which the new schema isknown. The version number of the schema on which the coded document isbased must be the first item communicated to the decoder. In thisconnection, the decoder has a table, in which is stored for each versionnumber the bit width or the number of schema branch codes SBC for allthe elements. Using this, the decoder determines that elements of type“PurchaseOrderType” are coded not with four bits but rather with onlythree bits. This information by itself permits it to decode the documentcorrectly.

Case 2:

Documents which have been coded in accordance with the new schemadefinition are transmitted to a decoder which knows only the old schema.By reference to the version number of the schema, the decoder recognizesthat unknown elements could be transmitted and that known elements couldbe coded with a different bit width. The new bit width of the elementsmust be known to the decoder, or else it would lose its synchronizationwith the encoder. Either the information assigning a bit width to theindividual elements, such as a table, is transmitted before the actualdocument, or the decoder can access this information under a specifiedaddress (URI).

Pursuant to the method according to the present invention, the encoderhas four options for coding the document:

Option 1:

For each new element, the length of the corresponding sub-tree istransmitted, as shown in FIG. 7.

By reference to the schema branch code 0101, the decoder recognizes thatthe element addressed is not contained in the standard schema.Accordingly, it interprets the next bits as the length L of the unknownelement. This length could be specified in accordance with the adaptivevariable integer coding, as specified in the ISO/IEC draft. Using thislength specification, it skips over the “biliTo” sub-tree and resumes atthe #SchemaBranch code 0010. The following element “Command” is then onewhich it is again able to decode.

Option 2:

After a new element, a unique synchronization sequence is transmitted,as shown in FIG. 8. The decoder parses the bitstream until it finds aResyncmarker as defined by the standard, from which point it continuesthe decoding. This method offers the possibility of coding a number ofnew elements without a break, and only transmitting the resync markerafter the last of them.

Option 3:

The tree structure nodes which contain the new elements, together withtheir position in the document tree, are transmitted before the actualdocument. With this method, therefore, the schema which is known to thedecoder is updated. The transmission of the document then takes place inaccordance with the case that the schema is known. In addition, it canuse the newly-transmitted schema to add the new elements to the ones ofwhich it is internally aware, provided that a unique version number isassigned to identify the new schema.

Option 4:

A complete new schema is transmitted. In this case, the decoder canhandle the document like one coded in accordance with a known schema. Inaddition, it can use the newly-transmitted schema to add the newelements to the ones of which it is internally aware, provided that aunique version number is assigned to identify the new schema.

The individual methods in accordance with the invention can be usedindependently of each other or in combination.

Although the present invention has been described with reference tospecific embodiments, those of skill in the art will recognize thatchanges may be made thereto without departing from the spirit and scopeof the present invention as set forth in the hereafter appended claims.

1-3. (canceled)
 4. A method for improving functionality of a binaryrepresentation of an XML-based content description, wherein a structureof any instance of an XML-document corresponds to a tree-like datastructure including a plurality of tree nodes, the method comprising thesteps of: providing that each tree node represents an element of thecontent description and has a structure which is defined in a schema;providing the tree nodes in binary representation with tree branchcodes; providing that the respective tree branch code has a schemabranch code; extending an existing schema by utilizing unused schemabranch codes in a tree node for extensions with new elements until theunused schema branch codes are all used up, and increasing a bit lengthof the schema branch code as necessary; and communicating a bit lengthchange to a decoder for correctly decoding the content description inthe binary representation.
 5. A method for improving functionality of abinary representation of an XML-based content description as claimed inclaim 4, the method further comprising the step of saving at least oneversion of the schema in the decoder, and performing one of (a) sendingonly data for calling up a schema change relative to a saved version,and (b) transmitting a schema change relative to a saved versiondirectly. 6-8. (canceled)